TREC Blog and TREC Chem: A View from the Corn Fields
نویسندگان
چکیده
This year the Blog Track contained two tasks: Top Stories Identification and Faceted Blog Distillation Tasks. Our submissions for both tasks are described below. In this, our first entry into the blog track, we explore various strategies (latent Dirichlet relevance model, URL based ranking, query expansion etc.) for both tasks. We first indexed the blog data with Lucene and identified occurrences of Headline URLs in the permalink documents (which included the content of the posts as well as the side bars of the web pages). Text windows (+/800 characters including HTML code) surrounding the occurrences were harvested. The four runs submitted for the first task and the two for the second are described below.
منابع مشابه
BIT at TREC 2010 Blog Track: Faceted Blog Distillation
This paper presents the work done for the TREC 2010 faceted blog distillation task. As the approach used in TREC 2009, a mixture of language models based on global representation is employed to rank the entire blogs by relevance and facets. The parameters in our approach are adjusted according to the experimental results in TREC 2009. In addition, we make use of the results evaluated in TREC 20...
متن کاملFEUP at TREC 2009 Blog Track: Temporal Evidence in the Faceted Blog Distillation Task
This paper describes the participation of FEUP, from the University of Porto, in the TREC 2009 Blog Track. FEUP participated in the faceted blog distillation task with work focused on the use of temporal features available in the new TREC Blogs08 collection. The approach presented in this paper uses the temporal information available in most individual posts to amplify (or reduce) each post’s s...
متن کاملOn the TREC Blog Track
The rise of blogging as a new grassroots publishing medium and the many interesting peculiarities that characterise blogs compared to other genres of documents opened up several new interesting research areas in the information retrieval field. The Blog track was introduced in 2006 as part of the renowned Text REtrieval Conference (TREC) evaluation forum, to drive research on the blogosphere an...
متن کاملUniversity of Lugano at TREC 2008 Blog Track
We report on the University of Lugano’s participation in the Blog track of TREC 2008. In particular we describe our system for performing opinion retrieval and blog distillation.
متن کاملFEUP at TREC 2010 Blog Track: Using h-index for blog ranking
This paper describes the participation of FEUP, from the University of Porto, in the TREC 2010 Blog Track. FEUP participated in the baseline blog distillation task with work focused on the use of link features available in the TREC Blogs08 collection. The approach presented in this paper uses the link information available in most individual posts to amplify each post’s score. Blog scores, and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009